Language and Pronunciation Modeling in the CMU 1996 Hub 4 Evaluation
نویسندگان
چکیده
We describe several language and pronunciation modeling techniques that were applied to the 1996 Hub 4 Broadcast News transcription task. These include topic adaptation, the use of remote corpora, vocabulary size optimization, n-gram cutoff optimization, modeling of spontaneous speech, handling of unknown linguistic boundaries, higher order n-grams, weight optimization in rescoring, and lexical modeling of phrases and acronyms.
منابع مشابه
The 1996 Hub-4 Sphinx-3 System
This paper describes the CMU Sphinx-3 system, and the configuration we used for the 1996 DARPA (Hub-4) evaluation. The model structure, acoustic modeling, language modeling, lexical modeling, and system structure are summarized. We also discuss the experimental results obtained with this system on the most recent DARPA evaluation, and some subsequent results are also discussed.
متن کاملLanguage and Pronunciation Modeling in the CMU
We describe several language and pronunciation modeling techniques that were applied to the 1996 Hub 4 Broadcast News transcription task. These include topic adaptation, the use of remote corpora, vocabulary size optimization, n-gram cutoff optimization, modeling of spontaneous speech, handling of unknown linguistic boundaries, higher order n-grams, weight optimization in rescoring, and lexical...
متن کاملAcoustic Analysis of Persian EFL Learners' Pronunciation of English Vowels
This paper reports the results of an experimental study on non-native production of English vowels. Two groups of Persian EFL learners varying in language proficiency were tested on their ability to produce the nine plain vowels of American English. Vowel production accuracy was assessed by means of acoustic measurements. Ladefoged and Maddison’s (1996) F1 F2 measurements for American English v...
متن کاملModeling Pronunciation of OOV Words for Speech Recognition
This paper presents a technique for modeling pronunciation in automatic speech recognition using an approach based on statistical machine translation. The task of a pronunciation model in speech recognition is to convert a sequence of phonemes into proper words of the language. This task can be realized as a machine translation approach, whereby the source language is a sequence of phonemes and...
متن کاملPronunciation Barriers and Computer Assisted Language Learning (CALL): Coping the Demands of 21st Century in Second Language Learning Classroom in Pakistan
Pronunciation of English language is a very important sub-skill of speaking module in second language learning process. However, it is ignored, neglected, and even never gotten least attention by the teachers, administrators, and stakeholders especially in Pakistan. Grammar, vocabulary, and the other linguistic skills such as reading and writing are emphasized whereas pronunciation has never be...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997